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1. Describe In detail what the components of the Invention are and tiow the invention works:* 

INVENTION 

As long as computer generated graphics are represented by a finite number of screen pixels, there will be some visual 
anomalies called the jaggies or staircasing, a phenomenon known as aliasing. The application of techniques that reduce 
aliasing is called anti-aliasing, of which full-scene anti-aliasing via supersampling is one of these techniques. Other anti- 
aliasing methods include sub-pixel computations, edge blending, and color accumulation. 

Supersampling is a simple approach to full-scene anti-aliasing in which the original scene is rendered at a higher 
resolution and then filtered down to the original screen resolution. This in effect raises the Nyqulst limit, which simply shifts 
the aliasing effect up to a higher spatial frequency. Though the technique does not eliminate aliasing completely, the 
method » simple and widely used by rQ8*y-30pjBpJiicsac^ today. However, there are some performance 
dnmbsda when this technique is employed, namely the extra processing, and memory storage and bandwidth required to 
render the Image at fctmes the onglnal resolution and later filtered down. Supersampling two times in each of x and y 
directions (*M) results In four time of processing, storage, and bandwidth. This patent describes an efficient 
implementation of supersampling without incurring extra memory storage and bandwidth by using a tile-based rendering 
architecture in conjunction with a unified graphics cache. ^ 

ll a ^" based . rer ^ eril ^ a £ hitecture witn a «ni««J graphics cache model, t h&jplor and dep th values for pixels inside 
5- „l a ^ q ^-! Qj ^ ra ^ h .!f! Colorgrj wtltiQn of the unifie d. c a ch e (Flgum i)" 1 tie tile size Is determined based on 

^ ^ °" na ^ stze oi the "" ified 9 ra P h l<* cache. For Instance, a 64KB unified graphics cache can 
axjommoda e a tile size of 128x64 pixels, each pixel consisted of 32bit color and depth values. Besides color and depth 
data, the unified graphics cache also stores the texture data annual 
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Figure 1: A L2 Sharing Model in an integrated microprocessor using tile-based 3D rendering architecture 
^Sm- ^-te^^^S architecture, each tile is render one at a time. The "Graphics Color/Z* cache is large enough 
£USK n !!? dta, ?. e0,0r ."I 2 d l te a ^ e8SeS f0r a " trian 9 tes that m hBUe the color and Z data are written^ 

rnmn^,l o 6 ? e T .1™ f er after J 88 ' trian 9 le in the tile ,inishes rendering. All pixels in the tile are then considered 
complete and will not be rendered again. The purpose of this invention is to exploit the benefit of a tite-based erdEtore 
.n conjunction wrth the micro-architectural features of a unified graphics cache to perform supersampling effk^nUy 

I5.S22 nl nS Wt ™ Relent implenientation of supersampling by eliminating the extra memory storage and bandwidth 
T c " S,n ? ^ ""'''f 1 9 ?P h,CS ***** modeL R 9 ure 2 b6l0w Ascribes the data flow whenVupersampling is 
S^5L2!2K?^ "ft*? k r 4 ' S A a ^ med supersampling 2* in both X and Y directions. The technique is 
SSSfSESSt.* .* l Val l JBS - ,hQ "Physical" tile size is assumed to be 128x64. and polygons are 

software-binned Into a "virtual" tile size of 64x32. K-wiraoic 
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r^pw Diagram for Efficient Supersampling Using Tile-Based Rendering with a Unified Graphics Cache 

1. Polygons, already binned into tiles, are received in their original forms into the graphics core, but are internally amplified 
4x of its original size. This is achieved through the viewport transformation supported by the graphics setup engine, 

2. The enlarged polygons are tile-based rendered Into the "GFX C/Z Tile Buffer* in the unified graphics cache (red line 
#1). Texture data for polygons being rendered are accessed from the "GFX Texture Cache" if hits occur to the cache 
(pink line #2). 

3. After the last triangle in the tile finishes rendering, the "GFX C/Z Tile Buffer" contains the complete image of the tile that 
is 4x of its original tile size. 

4. A stretch BLT is performed to down sample the image from the virtual tile down to the physical tile size. This is 
accomplished by rendering a rectangle (made up of two polygons) of the size equal to that of the physical tile size. The 
supersampled image in the virtual tile (still stored in the M GFX C/Z Tile Buffer") is considered the source of the stretch 
BLT, while the destination is allocated in the external memory. Internally, the micro-architecture treats the source of 
the stretch BLT as a texture map for the destination rectangle (blue line #3). The "GFX Texture Cache" is kept 
undisturbed to maintain good utilization of the texture data across tiles. 

5. While the stretch BLT is occurring and its results being written out to the "External Memory" (green line #4), rendering 
of the next tile can begin in the pipelined engine. 

ADVANTAGES 

Using a tile-based rendering and a unified graphics cache architecture, efficient anti-aliased (via supersampling) images 
can be created without increasing the external memory storage and bandwith requirements. This is achieved through the 
use of the unified graphics cache that provides a temporary storage for the supersampled image to be later filtered down 
(through streatch BLT). In this way, only the final Image of the original size needs to be written out and stored in the 
external frame buffer. On the contrary, a non-tiled based rendering engine must first render the entire supersampled 
image to a memory surface of the size of k times of the original screen resolution before the downsampling can occur. 
Typically, this memory is too large to be implemented on-chip which leads to an increase in the memory storage and 
bandwidth requirements. 

This invention is valuable to an integrated microprocessor targeting the Value PC segment where cost is a primary 
concern. Because of the cost constraint, the memory subsystem bandwidth is often limited due to the number of memory 
channels available. By avoiding extra memory bandwidth and storage requirements, an Integrated microprocessor can 
take advantages of the unified graphics cache and the tile-based rendering architecture to render images with full-scene 
antialiasing feature enabled with a minimal performance penalty. 
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